82 research outputs found
Computational Protein Design Using AND/OR Branch-and-Bound Search
The computation of the global minimum energy conformation (GMEC) is an
important and challenging topic in structure-based computational protein
design. In this paper, we propose a new protein design algorithm based on the
AND/OR branch-and-bound (AOBB) search, which is a variant of the traditional
branch-and-bound search algorithm, to solve this combinatorial optimization
problem. By integrating with a powerful heuristic function, AOBB is able to
fully exploit the graph structure of the underlying residue interaction network
of a backbone template to significantly accelerate the design process. Tests on
real protein data show that our new protein design algorithm is able to solve
many prob- lems that were previously unsolvable by the traditional exact search
algorithms, and for the problems that can be solved with traditional provable
algorithms, our new method can provide a large speedup by several orders of
magnitude while still guaranteeing to find the global minimum energy
conformation (GMEC) solution.Comment: RECOMB 201
Recommended from our members
TITER: predicting translation initiation sites by deep learning.
MotivationTranslation initiation is a key step in the regulation of gene expression. In addition to the annotated translation initiation sites (TISs), the translation process may also start at multiple alternative TISs (including both AUG and non-AUG codons), which makes it challenging to predict TISs and study the underlying regulatory mechanisms. Meanwhile, the advent of several high-throughput sequencing techniques for profiling initiating ribosomes at single-nucleotide resolution, e.g. GTI-seq and QTI-seq, provides abundant data for systematically studying the general principles of translation initiation and the development of computational method for TIS identification.MethodsWe have developed a deep learning-based framework, named TITER, for accurately predicting TISs on a genome-wide scale based on QTI-seq data. TITER extracts the sequence features of translation initiation from the surrounding sequence contexts of TISs using a hybrid neural network and further integrates the prior preference of TIS codon composition into a unified prediction framework.ResultsExtensive tests demonstrated that TITER can greatly outperform the state-of-the-art prediction methods in identifying TISs. In addition, TITER was able to identify important sequence signatures for individual types of TIS codons, including a Kozak-sequence-like motif for AUG start codon. Furthermore, the TITER prediction score can be related to the strength of translation initiation in various biological scenarios, including the repressive effect of the upstream open reading frames on gene expression and the mutational effects influencing translation initiation efficiency.Availability and implementationTITER is available as an open-source software and can be downloaded from https://github.com/zhangsaithu/titer [email protected] or [email protected] informationSupplementary data are available at Bioinformatics online
Recommended from our members
FreePSI: an alignment-free approach to estimating exon-inclusion ratios without a reference transcriptome.
Alternative splicing plays an important role in many cellular processes of eukaryotic organisms. The exon-inclusion ratio, also known as percent spliced in, is often regarded as one of the most effective measures of alternative splicing events. The existing methods for estimating exon-inclusion ratios at the genome scale all require the existence of a reference transcriptome. In this paper, we propose an alignment-free method, FreePSI, to perform genome-wide estimation of exon-inclusion ratios from RNA-Seq data without relying on the guidance of a reference transcriptome. It uses a novel probabilistic generative model based on k-mer profiles to quantify the exon-inclusion ratios at the genome scale and an efficient expectation-maximization algorithm based on a divide-and-conquer strategy and ultrafast conjugate gradient projection descent method to solve the model. We compare FreePSI with the existing methods on simulated and real RNA-seq data in terms of both accuracy and efficiency and show that it is able to achieve very good performance even though a reference transcriptome is not provided. Our results suggest that FreePSI may have important applications in performing alternative splicing analysis for organisms that do not have quality reference transcriptomes. FreePSI is implemented in C++ and freely available to the public on GitHub
From Static to Dynamic Structures: Improving Binding Affinity Prediction with a Graph-Based Deep Learning Model
Accurate prediction of the protein-ligand binding affinities is an essential
challenge in the structure-based drug design. Despite recent advance in
data-driven methods in affinity prediction, their accuracy is still limited,
partially because they only take advantage of static crystal structures while
the actual binding affinities are generally depicted by the thermodynamic
ensembles between proteins and ligands. One effective way to approximate such a
thermodynamic ensemble is to use molecular dynamics (MD) simulation. Here, we
curated an MD dataset containing 3,218 different protein-ligand complexes, and
further developed Dynaformer, which is a graph-based deep learning model.
Dynaformer was able to accurately predict the binding affinities by learning
the geometric characteristics of the protein-ligand interactions from the MD
trajectories. In silico experiments demonstrated that our model exhibits
state-of-the-art scoring and ranking power on the CASF-2016 benchmark dataset,
outperforming the methods hitherto reported. Moreover, we performed a virtual
screening on the heat shock protein 90 (HSP90) using Dynaformer that identified
20 candidates and further experimentally validated their binding affinities. We
demonstrated that our approach is more efficient, which can identify 12 hit
compounds (two were in the submicromolar range), including several newly
discovered scaffolds. We anticipate this new synergy between large-scale MD
datasets and deep learning models will provide a new route toward accelerating
the early drug discovery process.Comment: totally reorganize the texts and figure
Observation of nonrelativistic plaid-like spin splitting in a noncoplanar antiferromagnet
Spatial, momentum and energy separation of electronic spins in condensed
matter systems guides the development of novel devices where spin-polarized
current is generated and manipulated. Recent attention on a set of previously
overlooked symmetry operations in magnetic materials leads to the emergence of
a new type of spin splitting besides the well-studied Zeeman, Rashba and
Dresselhaus effects, enabling giant and momentum dependent spin polarization of
energy bands on selected antiferromagnets independent of relativistic
spin-orbit interaction. Despite the ever-growing theoretical predictions, the
direct spectroscopic proof of such spin splitting is still lacking. Here, we
provide solid spectroscopic and computational evidence for the existence of
such materials. In the noncoplanar antiferromagnet MnTe, the in-plane
components of spin are found to be antisymmetric about the high-symmetry planes
of the Brillouin zone, comprising a plaid-like spin texture in the
antiferromagnetic ground state. Such an unconventional spin pattern, further
found to diminish at the high-temperature paramagnetic state, stems from the
intrinsic antiferromagnetic order instead of the relativistic spin-orbit
coupling. Our finding demonstrates a new type of spin-momentum locking with a
nonrelativistic origin, placing antiferromagnetic spintronics on a firm basis
and paving the way for studying exotic quantum phenomena in related materials.Comment: Version 2, 30 pages, 4 main figures and 8 supporting figure
A community challenge for a pancancer drug mechanism of action inference from perturbational profile data
The Columbia Cancer Target Discovery and Development (CTD2) Center is developing PANACEA, a resource comprising dose-responses and RNA sequencing (RNA-seq) profiles of 25 cell lines perturbed with similar to 400 clinical oncology drugs, to study a tumor-specific drug mechanism of action. Here, this resource serves as the basis for a DREAM Challenge assessing the accuracy and sensitivity of computational algorithms for de novo drug polypharmacology predictions. Dose-response and perturbational profiles for 32 kinase inhibitors are provided to 21 teams who are blind to the identity of the compounds. The teams are asked to predict high-affinity binding targets of each compound among similar to 1,300 targets cataloged in DrugBank. The best performing methods leverage gene expression profile similarity analysis as well as deep-learning methodologies trained on individual datasets. This study lays the foundation for future integrative analyses of pharmacogenomic data, reconciliation of polypharmacology effects in different tumor contexts, and insights into network-based assessments of drug mechanisms of action.Peer reviewe
- …